Extract metric handling from the desired balancer reconciler #120085

DiannaHohensee · 2025-01-14T01:50:01Z

The DesiredBalanceReconciler is responsible for applying updates to
the cluster states that reflect shard allocation changes towards a
DesiredBalance. It isn't the Reconciler's responsibility to handle
pushing APM metrics. This patch cleans up the Reconciler constructor
and logic by extracting metric handling, modularizing metric updates
in the Allocator level of the code instead of being split across the
two components.

I want to expand upon the metric collection, so I'm yanking it out of the Reconciler. Otherwise I'll have to push more objects into the Reconciler to create all the stats there, which isn't really the Reconciler's business. Precursor to plugging in the work started over here #119916: once they're both in, I can start plug in the new stats.
This is purely a refactor now.

The DesiredBalanceReconciler applies desired balance to the routing table cluster metadata. This patch extract logic from the reconciler that pushes updates to the desired balance metrics. This removes extra parameters passed into the DesiredBalanceReconciler, and enables expanding the metrics that are collected without passing more objects into the reconciler class. Relates ES-10341

elasticsearchmachine · 2025-01-14T01:50:25Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

nicktindall · 2025-01-14T04:01:35Z

...rc/main/java/org/elasticsearch/cluster/routing/allocation/allocator/DesiredBalanceInput.java

+    long index,
+    RoutingAllocation routingAllocation,
+    List<ShardRouting> ignoredShards,
+    BalancingRoundStats.Builder clusterAllocationStatsBuilder


I feel like it's a bit confusing how there's a stats builder on the input, but we only use it for the eager reconciliation (assuming I've understood the code right). Could we instead just create one specifically for the eager reconciliation to make it clear it's local to that process?

I'm not sure I understand. What part are you referring to as eager reconciliation? Is it this balance() method?

My intention with passing around a stats data structure is to collect additional statistics throughout the balancing round -- for example, I'll want balancing round start and end times, so I'll need stats tracking as early as this line before computation begins. A balancing round begins with a DesiredBalanceInput submitted to the ContinuousComputation object, which queues requests for a balancing round: the latest DesiredBalanceInput submission will be run async.

I mean in allocate() we call reconcile(...) with desiredBalanceInput.clusterAllocationStatsBuilder() but then in the continuous computation we create separate StatsBuilders to pass to submitReconcileTask, even though desiredBalanceInput is in scope.
I just wonder whether the statsBuilder belongs in desiredBalanceInput? but I find the code a bit hard to follow so maybe there's an obvious reason for it being there.
I think I'm struggling to understand the lifecycle of the builder.

Ahh. I think you found something buggy. So allocate will queue a DesiredBalanceInput for computation, which gets picked up asynchronously. But then reconcile() is run immediately and updates the metrics. Computation happens eventually.

I thought I was being thorough by passing the stats builder everywhere, but reconciliation always runs whereas computation occasionally runs (queues, so an input will get skipped if there's a new input). And then computation will normally lead to reconciliation being called.

Okay... So allocate() needs two separate StatsBuilders, one for immediate reconciliation, and the other passed to computation but ultimately reconciliation. Or alternatively we skip instrumenting computation and create fresh StatsBuilders for each reconciliation event.

So there will be some use of the StatsBuilder in the computation phase (is that what // TODO: this will be expanded upon shortly in ES-10341. refers to?)

If so, is is as simple as just

Removing StatsBuilder from the DesiredBalanceInput

Creating one locally to pass to reconcile(...) in allocate(...)

Using the existing local StatsBuilder in processInput

?

So I've updated this patch to be purely refactor work. It doesn't lead into new work anymore. I think it's cleaner without pushing desired metrics objects into the Reconciler to handle: this cleans up the constructor and purpose of the Reconciler.

pxsalehi · 2025-01-14T15:04:08Z

I find it hard to judge how useful the refactoring decisions here and in #119916 are without actually seeing what metric are going to be collected. IMO, it would good to see a draft PR (potentially w/o tests is also ok) which shows concretely what we're collecting and then work out what are reasonable pieces to separate out, if that ends up being too large to review. My experience adding some metrics here was that while trying to plug things in, you'd stumble upon different twists and wrinkles that might impact the structure. (not sure how much has been discussed, but I find it hard to asses these refactoring PRs).

DiannaHohensee · 2025-01-14T17:49:37Z

I find it hard to judge how useful the refactoring decisions here and in #119916 are without actually seeing what metric are going to be collected.

Sure, here's a WIP PR for the summed changes. I have some notes on what to look at in that PR. Let me know what you think.

My experience adding some metrics here was that while trying to plug things in, you'd stumble upon different twists and wrinkles that might impact the structure. (not sure how much has been discussed, but I find it hard to asses these refactoring PRs).

Yes, the code is definitely tricky. It takes a lot of thinking to sort out what anything is right now.

I haven't really discussed the code implementation with anyone so far, only what I've tried to explain in the PRs. The design ticket, ES-10260, explains the high-level, in detail, to see where I'm headed.

pxsalehi · 2025-01-15T15:58:58Z

Thanks, I had a look at the WIP PR. I'm a bit lost following the changes, probably because: 1) the PR seems to be very general even though the specific metrics are still TODO. Any chance we could start small and gradually extend it? It is hard to follow the changes. 2) There is a lot of noise that is not really a core change, e.g. adding new comments to existing code and renames, it makes it hard to focus on what the actual change is. Just two suggestions that would make reviewing for me personally easier. As for the way stats are collected, I think I agree that passing around builder like that is a bit confusing and probably we shouldn't add it to the input like that. Do the gathered metrics need to be all centralized like that?

DiannaHohensee · 2025-01-16T19:25:45Z

I've realized that my understanding of the code was incorrect. Reconciliation only applies a handful of changes from the current desired balance computation, until it reaches throttling saturation (e.g. each node can only participate in X concurrent shard recoveries). So, as an example, the reconciler might stop assigning shards in allocateUnassigned, and still run moveShards and balance, but those methods won't do anything further.

So I'll need to instrument the computation, not reconciliation. I'm investigating if I can compare old to new desired balance here to get the stats.

I think refactoring the reconciler to pull out stats reporting still makes sense, makes things more modular, but I'll see how it looks.

DiannaHohensee · 2025-01-17T18:35:26Z

Slowly making progress here.. I've filed ES-10581 as an extra twist.

…-from-reconciler

DiannaHohensee

Alrighty, I've refactored my change to only extract the logic out of the Reconciler, keeping all the desired balance metric update logic in the Allocator.

DiannaHohensee · 2025-01-27T22:05:52Z

...rc/main/java/org/elasticsearch/cluster/routing/allocation/allocator/DesiredBalanceInput.java

+    long index,
+    RoutingAllocation routingAllocation,
+    List<ShardRouting> ignoredShards,
+    BalancingRoundStats.Builder clusterAllocationStatsBuilder


So I've updated this patch to be purely refactor work. It doesn't lead into new work anymore. I think it's cleaner without pushing desired metrics objects into the Reconciler to handle: this cleans up the constructor and purpose of the Reconciler.

…-from-reconciler

DiannaHohensee · 2025-02-05T14:59:32Z

I'm closing this -- since there hasn't been much activity and it's a bit messy from the pivot -- and replacing it with #121771.

DiannaHohensee added >non-issue :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) Team:Distributed Coordination Meta label for Distributed Coordination team labels Jan 14, 2025

DiannaHohensee self-assigned this Jan 14, 2025

elasticsearchmachine added the v9.0.0 label Jan 14, 2025

DiannaHohensee requested review from nicktindall and pxsalehi January 14, 2025 01:53

fix link

7f24a2b

nicktindall reviewed Jan 14, 2025

View reviewed changes

DiannaHohensee requested a review from nicktindall January 14, 2025 17:26

DiannaHohensee mentioned this pull request Jan 14, 2025

(WIP) Plug in balancer round summary stats #120135

Closed

DiannaHohensee and others added 4 commits January 27, 2025 16:15

Merge branch 'main' into 2025/01/10/ES-10341-extract-metrics-handling…

c9ff4f1

…-from-reconciler

revamp, purely a refactor

c57565e

[CI] Auto commit changes from spotless

133ea3a

touchups

7fcf9fe

DiannaHohensee commented Jan 27, 2025

View reviewed changes

DiannaHohensee added >tech debt and removed >tech debt labels Jan 27, 2025

elasticsearchmachine added v9.1.0 and removed v9.0.0 labels Jan 30, 2025

Merge branch 'main' into 2025/01/10/ES-10341-extract-metrics-handling…

68b2f97

…-from-reconciler

DiannaHohensee closed this Feb 5, 2025

Extract metric handling from the desired balancer reconciler #120085

Extract metric handling from the desired balancer reconciler #120085

Uh oh!

Conversation

DiannaHohensee commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jan 14, 2025

Uh oh!

nicktindall Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nicktindall Jan 15, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jan 15, 2025

Choose a reason for hiding this comment

Uh oh!

nicktindall Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

pxsalehi commented Jan 14, 2025

Uh oh!

DiannaHohensee commented Jan 14, 2025 • edited by atlassian bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pxsalehi commented Jan 15, 2025

Uh oh!

DiannaHohensee commented Jan 16, 2025

Uh oh!

DiannaHohensee commented Jan 17, 2025

Uh oh!

DiannaHohensee left a comment

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee commented Feb 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DiannaHohensee commented Jan 14, 2025 •

edited

Loading

DiannaHohensee Jan 14, 2025 •

edited

Loading

DiannaHohensee commented Jan 14, 2025 •

edited by atlassian bot

Loading